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E. coli 

MSYQVIiARKWRPQTFADWGQEHVLTAIiANGLSLGRIHHAYLFSGTRS 
B. siibtilis 

MSYQALYRVFRPQRFEDWGQEHITKTLQNALLQKKFSHAYLFSGPRGTSKT 



E. coli 

GLNCETGITATPCGVCDNCREIEQGRFVDLIEIDAASRTKVEDTRDLLDNVQYAPARGRF 
B. subtilis 

AWCEHAPVDEPCNECAACKGITNGS I SDVIEIDAASlSnsrGVDEIRDIRDKVKFAPSAW 



E. coli 

KVYLIDEVH^ILSRHSFNALLKTLEEPPEHVKFLIiATTDPQKLPVTILSRCL^ 
B. subtilis 

KVYIIDEVHMLSIGAFNALLKTIiEEPPEHCIFILATTEPHKIPLTIISRCQRFDFKRITS 
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TCCGGGGGTG GGGTTCCCAG GTAGACCCCG GCCCCTCCCG . TGAGCCCCTT TACCCAGGCC 60 

GCCACCTCCT CCAGGGGGGC CAAGGCGTGC AAGGAGAGGA ACGTCCGCAC CACGCCCTAT 120 

S.D. 
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CGACCTCGGA CAAGAGACCG TGGACAACAT CCTCAAGCGC CTCCGCCGTA 

GGTGCGGGGG CTCCAGAAGA TGGTGGCCGA GGGCCGCCCC TGCGACGAGG TCCTCACCCA 1880 

GATGACCGCC ACCAAGAAGG CCATGGAGGC GGCGGCCACC CTGATCCTCC ACGAGTTCCT 1940 

GAACGTCTGC GCCGCCGAGG TCTCCGAGGG CAAGGTGAAC CCCAAGAAGC CCGAGGAGAT 2 000 

CGCCACCATG CTGAAGAACT TCATCTA 2 027 
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GTG AGC GCC CTC TAC CGC CGC TTC CGC CCC CTC ACC TTC CAG GAG GTG GTG 51 

GGG CAG GAG CAC GTG AAG GAG CCC CTC CTC AAG GCC ATC CGG GAG GGG AGG CTC GCC CAG 111 

GCC TAC CTC TTC TCC GGG CCC AGG GGC GTG GGC AAG ACC ACC ACG GCG AGG CTC CTC GCC 171 

ATG GCG GTG GGG TGC CAG GGG GAA GAC CCC CCT TGC GGG GTC TGC CCC CAC TGC CAG GCG 231 

GtG CAG AGG GGC GCC CAC CCG GAC GTG GTG GAC ATT GAC GCC GCC AGC AAC AAC TCC GTG 291 

GAG GAC GTG CGG GAG CTG AGG GAA AGG ATC CAC CTC GCC CCC CTC TCT GCC CCC AGG AAG 351 

GTC TTC ATC CTG GAC GAG GCC CAC ATG CTC TCC AAA AGC GCC TTC AAC GCC CTC CTC AAG 411 

ACC CTG GAG GAG CCC CCG CCC CAC GTC CTC TTC GTC TTC GCC ACC ACC GAG CCC GAG AGG 471 

ATG CCC CCC ACC ATC CTC TCC CGC ACC CAG CAC TTC CGC TTC CGC CGC CTC ACG GAG GAG 531 

GAG ATC GCC TTT AAG CTC CGG CGC ATC CTG GAG GCC GTG GGG CGG GAG GCG GAG GAG GAG 591 

GCC CTC CTC CTC CTC GCC CGC CTG GCG GAC GGG GCC CTT AGG GAC GCG GAA AGC CTC CTG 651 

GAG CGC TTC CTC CTC CTG GAA GGC CCC CTC ACC CGG AAG GAG GTG GAG CGC GCC CTA GGC 711 

TCC CCC CCA GGG ACC GGG GTG GCC GAG ATC GCC GCC TCC CTC GCG AGG GGG AAA ACG GCG 771 

GAG GCC CTG GGC CTC GCC CGG CGC CTC TAC GGG GAA GGG TAC GCC CCG AGG AGC CTG GTC 831 

TCG GGC CTT TTG GAG GTG TTC CGG GAA GGC CTC TAC GCC GCC TTC GGC CTC GCG GGA ACC 891 

CCC CTT CCC GCC CCG CCC CAG GCC CTG ATC GCC GCC ATG ACC GCC CTG GAC GAG GCC ATG 951 

GAG CGC CTC GCC CGC CGC TCC GAC GCC TTA AGC CTG GAG GTG GCC CTC CTG GAG GCG GGA 1011 

AGG GCC CTG GCC GCC GAG GCC CTA CCC CAG CCC ACG GGC GCT CCT TCC CCA GAG GTC GGC 1071 

CCC AAG CCG GAA AGC CCC CCG ACC CCG GAA CCC CCA AGG CCC GAG GAG GCG CCC GAC CTG 1131 

CGG GAG CGG TGG CGG GCC TTC CTC GAG GCC CTC AGG CCC ACC CTA CGG GCC TTC GTG CGG 1191 

GAG GCC CGC CCG GAG GTC CGG GAA GGC CAG CTC TGC CTC GCT TTC CCC GAG GAC AAG GCC 12 51 

TTC CAC TAC CGC AAG GCC TCG GAA CAG AAG GTG AGG CTC CTC CCC CTG GCC CAG GCC CAT 1311 

TTC GGG GTG GAG GAG GTC GTC CTC GTC CTG GAG GGA GAA AAA AAA AGC CTG AGC CCA AGG 1371 

CCC CGC CCG GCC CCA CCT CCT GAA GCG CCC GCA CCC CCG GGC CCT CCC GAG GAG GAG GTA 1431 

GAG GCG GAG GAA GCG GCG GAG GAG GCC CCG GAG GAG GCC TTG AGG CGG GTG GTC CGC CTC 1491 

CTG GGG GGG CGG GTG CTC TGG GTG CGG CGG CCC AGG ACC CGG GAG GCG CCG GAG GAG GAA 1551 
CCC CTG AGC CAA GAC GAG ATA GGG GGT ACT GGT ATA TAA (1590) 
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Met ser ala leu tyr arg arg phe arg pro leu thr phe gin glu val val gly gin glu 20 
his val lys glu pro leu leu lys ala ile arg glu gly arg leu ala gin ala tyr leu 40 
phe ser gly pro arg gly val gly lys thr thr thr ala arg leu leu ala met ala val 60 
gly cys gin gly glu asp pro pro cys gly val cys pro his cys gin ala val gin arg 80 
gly ala his pro asp val val asp ile asp ala ala ser asn asn ser val glu asp val 100 
arg glu leu arg glu arg ile his leu ala pro leu ser ala pro arg lys val phe ile 120 
leu asp glu ala his met leu ser lys ser ala phe asn ala leu leu lys thr leu glu 140 
=1 glu pro pro pro his val leu phe val phe ala thr thr glu pro glu arg met pro pro 160 
y thr ile leu ser arg thr gin his phe arg phe arg arg leu thr glu glu glu ile ala 180 
11 phe lys leu arg arg ile leu glu ala val gly arg glu ala glu glu glu ala leu leu 200 
?^ leu leu ala arg leu ala asp gly ala leu arg asp ala glu ser leu leu glu arg phe 220 
=n leu leu leu glu gly pro leu thr arg lys glu val glu arg ala leu gly ser pro pro 240 
J; gly thr gly val ala glu ile ala ala ser leu ala arg gly lys thr ala glu ala leu 260 
gly leu ala arg arg leu tyr gly glu gly tyr ala pro arg ser leu val ser gly leu 280 
leu glu val phe arg glu gly leu tyr ala ala phe gly leu ala gly thr pro leu pro 300 
^ ala pro pro gin ala leu ile ala ala met thr ala leu asp glu ala met glu arg leu 320 

360 
380 



ala arg arg ser asp ala leu ser leu glu val ala leu leu glu ala gly arg ala leu 
ala ala glu ala leu pro gin pro thr gly ala pro ser pro glu val gly pro lys pro 
glu ser pro pro thr pro glu pro pro arg pro glu glu ala pro asp leu arg glu arg 



il trp arg ala phe leu glu ala leu arg pro thr leu arg ala phe val arg glu ala arg 400 
:Z pro glu val arg glu gly gin leu cys leu ala phe pro glu asp lys ala phe his tyr 42 0 
^ arg lys ala ser glu gin lys val arg leu leu pro leu ala gin ala his phe gly val 440 
^ glu glu val val leu val leu glu gly glu lys lys ser leu ser pro arg pro arg pro 460 
ala pro pro pro glu ala pro ala pro pro gly pro pro glu glu glu val glu ala glu 480 
glu ala ala glu glu ala pro glu glu ala leu arg arg val val arg leu leu gly gly 500 
arg val leu trp val arg arg pro arg thr arg glu ala pro glu glu glu pro leu ser 520 
gin asp glu ile gly gly thr gly ile 529 
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Met ser ala leu tyr arg arg phe arg pro leu thr phe gin glu val val gly gin glu 20 
his val lys glu pro leu leu lys ala ile arg glu gly arg leu ala gin ala tyr leu 40 
phe ser gly pro arg gly val gly lys thr thr thr ala arg leu leu ala met ala val 60 
gly cys gin gly glu asp pro pro cys gly val cys pro his cys gin ala val gin arg 80 
gly ala his pro asp val val asp ile asp ala ala ser asn asn ser val glu asp val 100 
arg glu leu arg glu arg ile his leu ala pro leu ser ala pro arg lys val phe ile 120 
leu asp glu ala his met leu ser lys ser ala phe asn ala leu leu lys thr leu glu 140 
glu pro pro pro his val leu phe val phe ala thr thr glu pro glu arg met pro pro 160 
thr ile leu ser arg thr gin his phe arg phe arg arg leu thr glu glu glu ile ala 180 
phe lys leu arg arg ile leu glu ala val gly arg glu ala glu glu glu ala leu leu 200 
leu leu ala arg leu ala asp gly ala leu arg asp ala glu ser leu leu glu arg phe 220 
leu leu leu glu gly pro leu thr arg lys glu val glu arg ala leu gly ser pro pro 240 
gly thr gly val ala glu ile ala ala ser leu ala arg gly lys thr ala glu ala leu 260 
gly leu ala arg arg leu tyr gly glu gly tyr ala pro arg ser leu val ser gly leu 280 
leu glu val phe arg glu gly leu tyr ala ala phe gly leu ala gly thr pro leu pro 300 
ala pro pro gin ala leu ile ala ala met thr ala leu asp glu ala met glu arg leu 320 
ala arg arg ser asp ala leu ser leu glu val ala leu leu glu ala gly arg ala leu 340 
ala ala glu ala leu pro gin pro thr gly ala pro ser pro glu val gly pro lys pro 360 
glu ser pro pro thr pro glu pro pro arg pro glu glu ala pro asp leu arg glu arg 380 
' trp arg ala phe leu glu ala leu arg pro thr leu arg ala phe val arg glu ala arg 400 
I pro glu val arg glu gly gin leu cys leu ala phe pro glu asp lys ala phe his tyr 420 
I arg lys ala ser glu gin lys val arg leu leu pro leu ala gin ala his phe gly val 440 
glu glu val val leu val leu glu gly glu lys lys lys ala 454 
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ATP site 

E . coli MqvnvT.ARTaArpPQTFAnwnQEHVLTALANGLSLGRIHHAYLFS GTRGVGKT SIARLLAK 60 

H.inf , K II KDN.L F. . 60 

B.sub. A.Y.VF. . .R.E ITKT . Q . A . LQKKFS P. .T A.KIF. . 60 

Ceres. DA.T Y. .R. .E.LI. . .AMVRT. . .AF.T. , .A. .FMLT.V TT R 113 

M.gen. -MH. .FYQ.Y. .IN.KQTL. . . SIRKI . V. AINRDKLPNG . I . . .E. .T. . .TF.KII. . 59 

T.th. — VSA.Y.RF. .L. .QE KEP.LKAIRE, . LAQ P TT M 58 

Zn"*"**" finger 
* * * * 

E.coli GLNCET GITATPCGVCDNCREIEQGRFVDLIEIDAASRTKVEDTRDLLDNVQYAPA 116 

H.inf . . . . .VH .V E.E..KA....N.I E K.V 116 

B. sub. AV. . .H APVDE. . NE . AA . KG . TN . SIS . V NNG . DEI . .IR.K.KF, .S 116 

C. cres. A. .Y. . DTVK . PSVDLTTEGYH . . S . IE . .HM.VL.L DEM.E. . .G.R. . .V 173 

M.gen. AI . . LN WDQIDV.NS. . V . KS . NTNSAI . IV KNGIN . I . E . VE . . FNH . F 115 

T.th AVG.QG EDP PH . QAVQR . AHP . WD NNS . . .V.E.RERIHL. .L 112 

E . coli RGRFKVYLIDEVHMLSRHSFNALLKTLEEPPEHVKFLLATTDPQKLPVTILSRCLQFHLK 17 6 

H.inf. V Y 176 

B.sub. AVTY. . .1 IGA CI . I E . H . I . L . . I . . . QR . OF . 17 6 

C . cres . EA . Y . . . I TAA P.A..IF...EIR.V QR.D.R 233 

M.gen. TFKK. . .IL. .A. . .TTQ.WGG S.PY.L.IFT. . EFN . I . L QS . FF . 17 5 

T.th. SAPR. .FIL. .A. . . .KSA P. .L.VF. . .E.ERM.P TQH . RFR 172 

E . coli ALDVEQIRHQLEHILNEEHIAHEPRALQLLARAAEGSLRDALSLTDQAIASGDGQ — VST 234 

H . inf . . . . ET . . SQH .A...TQ.N.PF.DP..VK..K..Q..I..S M..R. — .TN 234 

B. sub. RITSQA.VGRMNK.VDA.QLQV.EGS.EII.S. .H.GM L. . . .SFSGDI--LKV 234 

C. cres. RVEPDVLVKHFDR . SAK . GARI . MD . .A.I V..G...L... .VQTERGQT.TS 293 

M . gen . KITSDL . LER . ND . AKK . K . KI . KD . . IKI . DLSQ G . . . L . . LAI . LIVKKL . LL 2 3 5 

T.th. R.TE.E.AFK.RR. . EAVGREA . EE . .L. . . .L.D.A. . . .E. . LERFLLLEGP LTR 229 

E . coli QAVSAMLGTLDDDQALSLVEAMVEANGERVMALINEAAARGIEWEALLVEMLGLLHRIAM 29 4 

H.inf. NV. .N. . .L. . . NYSVDILY . LHQG . . . LL . RTLQRV. DAAGD . DK . . G . CAEK , .Q. .L 2 94 

B . sub . EDALLIT . AVSQLYIGK . AKSLHDK . VSDALETL . . LLQQ . KDPAK . lED . IFYFRDMLL 2 94 

C . cres . TV . RD . . . LA . RS . TIA . Y . HVMAGKTKDALEGFRALWGF . ADPAWMLDV . DHC . AS . V 3 5 3 

M . gen . MLKKHLISLIEMQNL . L . KQFYQ .1 2 60 

T . th . KE . ERA . . SPPGTGVAEIAASLARGKTAEALG . ARRLYGE . YAPRS . VSGL . EVFHEGLY 2 89 
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A) Alignment of TTHl with alphas subunits of other organisms, 

E.coli DRYFLELIRTGRPDEESYLHAAVELAEARGLPW 197 

V.chol. DHFYLELIRTGRADEESYLHFALDVAEQYDLPW 197 

H . inf . DHFYLALSRTGRPNEERYIQAALKLAERCDLPLV 197 

R.prow. DRFYFEIMRHDLPEEQFIENSYIQIASELSIPIV 195 

H.pyl. DDFYLEIMRHGILDQRFIDEQVIKMSLETGLKII 213 

S . sp . DDYYLEIQDHGSVEDRLVNINLVKIAQELDIKIV 202 

M.tub. DNYFLELMDHGLTIERRVRDGLLEIGRAIiNIPPL 220 
T , th , FFIEIQNHGLSEQK 



(ID#72) 
(ID#73) 
{ID#74) 
(ID#75) 
(ID#76) 
(ID#77) 
(ID#78) 
(ID#61) 



B) Alignment of TTH2 with alphas subunits of other organisms. 

E.coli NKRRAKNGEPPLDIAAIPLDDKKSFDMLQRSETTAVFQLESRGMKD 618 (ID#79) 

V.chol. NPRLKKAGKPPVRIEAIPLDDARSFRNLQDAKTTAVFQLESRGMKE 618 (ID#80) 

H.inf. NVRMVREGKPRVDIAAIPLDDPESFEIiLKRSETTAVFQLESRGMKD 618 (ID#81) 

R.prow. CKKLLKEQGIKIDFDDMTFDDKKTYQMLCKGKGVGVFQFESIGMKD 624 (ID#82) 

H.pyl. LKIIKTQHKISVDFLSLDMDDPKVYKTIQSGDTVGIFQIES-GMFQ 648 (ID#83) 

S . sp . QERKALQIRARTGSKKLPDDVKKTHKLLEAGDLEGIFQLESQGMKQ 643 ( ID#84 ) 

M . tub . IDNVRANRGIDLDLESVPLDDKATYELLGRGDTLGVFQLDGGPMRD 646 ( ID# 85 ) 

T.th. RVELDYDALTLDD (ID#60) 
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A) 

Start codon 

ATGGGCCGGGAGCTCCGCTTCGCCCACCTCCACCAGCACA 

CCCAGTTCTCCCTCCTGGACGGGGCGCCGAAGCTTTCCGA 

CCTCCTCAAGTGGGTGGAGGAGACGACCCCCGAGGACCCC 120 

GCCTTGGCCATGACCGACCACGGCAACCTCTTCGGGGCCG 

TAGAGTTCTACAAGAAGGCCGCCGAAATGGGCATCGAGCC 

C ATCCTGGGTACGAGGCCTTACGTGGCGGCGGAAAGCCCG 240 

TTTGACCGCAAGCGGGGAAAGGGCCTAGACGGGGGCTACT 

TTCACCTCACCCTCCTCGCCAAGGACTTCACGGGGTACCA 

GAACCTGGTGCGCCTGGCGAGCCGGGCTTACCTGGAGGGG 3 60 

TTTTACGAAAAGCCCCGGATTGACCGGGAGATCCTGCGCG 

AGCGCCGAGGGCCTCATCGCCTCTCGGGGTGCCTCGGGGC 

GGAGATCCCCCAGTTCATCCTCCAGGACCGTCTGGACCTG 480 

GCCGAGGCCCGGCTCAACGAGGACCTCTCCATCTTCAAGG 

ACCGCTTCTTCATTCACATCCAGAACCACGGCCTCCCCGA 

GCAGAAAAAGGTCAACGAGGTCCTCAAGGAGTTCGCCCGA 600 

AAGTACGGCCTGGGGATGGTGGCCACCAACGACGGCCATT 

ACGGGAGGAAGGAGGCCCGCAGCGCCCACGAGGTTTTCCT 

CGCCATCCAGTCCAAGAGCACCCTGGACGACCCCGGGGCC 720 

GTTGGCTTTCCCCTGCGGGAGTTCTACGTGAAGACCCCCG 

AGGAGACGTGCGGGCCGGTGTTCCCCGAGGAGGAGTGGGG 

GGACGAGCCCTTTGACAACACCGTGGAGATCGCCCGCATG 840 

TGCAACGTGGAGCTGCCCATCGGGACAAGATGGTCTACCC 

GAATCCCCCGCTTCCCCCTCCCCGAGGGACCGGGGACCGA 

GGCCAAGTACCTAATGGAGCTAACCTTCAAGGGGCCCCTC 960 

CGCCGTTACCCGGACCGAATCACCGAGGGTTTCTACCGGG 

AGGTTTTCCGCCTTTTGGGGAAGCTTCCCCCCCACGGGCA 

CGGGGAGGCCTTGGCCGAGGCCTTGGCCCAGGTGGAGCGG 1080 

GAGGCTTGGGAGAGGCTCATGAAGAGCCTCCCCCCCTTTG 

ACCGGGGTCCAAGGAGTTCCA 



B) 

MGRELRFAHLHQHTQFSLLDGAPKLSDLLKWVEETTPEDP 
ALAMTDHGNLFGAVEF YKKAAEMG I E P I LGTR P YVAAES P 
FDRKRGKGLDGGYFHLTLLAKDFTGYQNLVRLASRAYLEG 120 
FYEKPRIDREILRERRGPHRLSGCLGAEIPQFILQDRLDL 

FFXEXQNHGIiSEQK 
AEARLNEDLSIFKDRFFIHIQNHGLPEQKKVNEVLKEFAR 
KYGLGMVATNDGHYGRKEARSAHEVFLAIQSKSTLDDPGA 240 
VGFPLREFYVKTPEETCGPVFPEEEWGDEPFDNTVEIARM 
CNVELPIGTRWSTRIPRFPLPEGPGTEAKYLMELTFKGPL 
RRYPDRITEGFYREVFRLLGKLPPHGHGEALAEALAQVER 3 60 
EAWERLMKSLPPFDRGPRSS 
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Startl Start2 3'-Exo I 
T . th . VERWRTLLDGRFLLEEGVGLWEWRYPFPLEGEAWVLDIiETTGLAG LDEVIEVGLLRLEGG RRLPF 

D . rad . PWPQDWVFDIiETTGFSPA S AA I VE I GAVR I VGGQ I DETLKF 

Bac . sub. HGIKMIYGMEANLVDDGVPIAYNAAHRLLEEETYWFDVETTGLSAV YDTIIELAAVKVKGGE — IIDKF 

H . inf . MINPNRQIVLDTETTGMNQLGAHYEGHCI lEIGAVELINRR- YTGNNX 

E. G. MSTAITRQIVIiDTETTGMNQIGAHSEGHKIIEIGAVEWNRR-LTGNNF 
H.pyl. NLEYLKACGLNFIETSENLITLKNLKTPLKDEVFSFIDLETTGSCPI KHEILEIGAVQVKGGE--IINRF 

3'-Exo II 

th . QSLVR-PLPP AEARSWNLT GIPREALEEAPSLEEVLEKAYPLRGDATLVIHNAAFDLGFL-RPALEGLG 

W. rad . ETLVR- PTRPDGSMLSIPWQAQRVHGISDEMVRRAPAXKDVLPDFFDFVDGSAWAHNVSFIXK3FM-RAGAERLG 

©ac . sub. EAFAN-PHRP LSATIIELT GITDDMLQDAPDWDVIRDFREWIGDDILVAHNASFDMGFL-NVAYKKLL 

ill inf . HIYIK-PDRP XDPDAIKVH GITDEMLADKPEFKEVAQDFLDYINGAELLIHNAPFDVGFM-DYEFRKLN 

:g. c . HVYLK-DRLV DPEAFGVH GIAVDFLLDKPTFAEVAVEFMDYIRGAELVIHNAAFDIGFM-DYEFSLLK 

U% pyl . ETLVKVKSVP DYIAELT GITYEDTLNAPSAHEALQELRLFLGNSVFVAHNANFDYNFLGRYFVEKLH 

: 3 '-Exo IIIC 

th . YRLENPWDSLRLARRGLPGLRRYGLDALSEVLELPRRT- -CHRALEDVERTLAWHEVYYMLT SG 

t^. rad . LSWAPERELCTMQLSRRAFPRERTHNLTVLAERLGLEFAPGGRHRSYGDVQVTAQAYLRLLELLG ER 

^Bac . sub . E VEKAKNPVIDTLELGRFLYPEFKNHRLNTLCKKFDIELTQ — HHRAIYDTEATAYLLLKMLKDAA EK 

iffi. inf . -LWKTDDICLVTDTLQMARQMYPGKRN-NLDALCDRLGIDNSKRTLHGALLDAEILADVYLMMTGGQTNLFDEEE 

lEl c . RDIAKTNTFCKVTDSUVVARKMFPGKRN-SLDALCARYEIDNSKRTLHGALLDAQILAEVYLAMTGGQTS 

|Hb.pyl . CPLLNLKLCTLDLSKRAILSMRY-SLSFLKELLGFGIEV--SHRAYADALASYKLFEICLLNLP--SYIKT 
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A) 

ATGGTGGAGCGGGTGGTGCGGACCCTTCTGGACGGGAGGT 4 0 
TCCTCCTGGAGGAGGGGGTGGGGCTTTGGGAGTGGCGCTA 
CCCCTTTCCCCTGGAGGGGG AGGCGGTGGTGGTCCTGGAC 120 
CTGGAGACCACGGGGCTTGCCGGCCTGGACGAGGTGATTG 
AGGTGGGCCTCCTCCGCCTGGAGGGGGGGAGGCGCCTCCC 200 
CTTCCAGAGCCTCGTCCGGCCCCTCCCGCCCGCCGAAGCC 
CGTTCGTGGAACCTCACCGGCATCCCCCGGGAGGCCCTGG 2 80 
AGGAGGCCCCCTCCCTGGAGGAGGTTCTGGAGAAGGCCTA 
CCCCCTCCGCGGCGACGCCACCTTGGTGATCC ACAACGCC 360 
GCCTTTGACCTGGGCTTCCTCCGCCCGGCCTTGGAGGGCC 
TGGGCTACCGCCTGGAAAACCCCGTGGTGGACTCCCTGCG 440 
CTTGGCCAGACGGGGCTTACCAGGCCTTAGGCGCTACGGC 
CTGGACGCCCTCTCCGAGGTCCTGGAGCTTCCCCGAAGGA 520 
CCTGCCACCGGGCCCTCGAGGACGTGGAGCGCACCCTCGC 
CGTGGTGC ACGAGGTATACTATATGCTTACGTCCGGCCGT 600 
CCCCGCACGCTTTGGGAACTCGGGAGGTAG 



B) 

MVERWRTLLDGRFLLEEGVGLWEWRYPFPLEGEAVWLD 40 
LETTGLAGLDEVIEVGLLRLEGGRRLPFQSLVRPLPPAEA 
RSWNLTGIPREALEEAPSLEEVLEKAYPLRGDATLVIHNA 120 
AFDLGFLRPALEGLGYRLENPWDSLRLARRGLPGLRRYG 
LDALSEVLELPRRTCHRALEDVERTLAWHEVYYMLTSGR 200 
PRTLWELGRZ 



Figure 18 



eS^P-iyg N (Sheet of 31) 

Alignment of dnaA genes . 

P. mar. MLEASWEK VQSSL--KC^SK — PSYE TWIRPTEFSG — FKN GELTLIAPNSFSSAW LKNNYSQTIQETAE- 65 

Syn.sp. MVSCENLWQQ ALAIL--ATQLTK-- PAFD TWIKASVLIS— LGD GVATIQVENGFVLNH LQKSYGPLLMEVLT- 67 

B.SUt. MENILDLWNQ ALAQI— EKKLSK— PSFE TWMKSTKAHS— LQG DTLTITAPNEFARDW LESRYLHLIAOTIY- 67 

M.tub. MTDDPGSGFTTVWNA WSELNGDPKVDDGP SSDANLSAPLTPQQR AWLNLVQPLT — IVE GFALUSVPSSFVQNE lERHLRAPITDALS- 87 

T.th. MSHEAVWQH VLEHI--RRSITE — VEFH TOFERIRPIjG--IRD GVLELAVPTSFALEW IRRHYAGLIQBGPR- 66 

E.coli MSLSLWQQ CLARL— QDELPA — TEFS MWIRPLQAE LSD NTLALYAPNRFVU3W VRDKYLNNINGLLT- 64 

T.mar. MKER ILQEI— KTRVNR— KSWE LWFSSFDVKS— lEG NKWFSVGNLFIKEW LEKKYYSVLSKAVK- 61 

H.pyl. MOTMNNIEKE ILALVKQNPKVSL- lEYE NYFSQLKYNPNASKS DIAFFYAPNQVLCTT ITAKYGALLKEILSQ 72 

P. mar. EIPG EPVTVHVK VKANAESSDEHYSSA P ITPPLEASPGSV DSSGSSLRLSK -KTLPLLNLRYVFNR 130 

Syn.sp. DLTG QEITVKLI TDGLEPHS LIGQ E SSLPMETTP -KNATALNGKYTFSR 115 

B.SUt. ELTG EELSIKFV IPQNQDVEDFMPKPQ VKKAVKEDTSDFPQN MLNPKYTFOT 119 

M.txib. RKLGH-QIQIiGVRIA PPATDEADEOTVPPS ENPATTSPDTTTDND EIDDSAAARGDNQHS WPSYFTERPHNTDSA TAGVTSLNRRVTFET 176 

T.th. LLGAQ-APRFELRW PGVWQEDIFQPPPS PPAQAQP : EDTFKT 108 

E.coli SFCGADAPQLRFEVG TKPVTQTPQAAVTSN VAAPAQVAQTQPQRA APSTRSGWDNVPAPA EP -TYRSNVNVKHTFDN 140 

T.mar. WLG NDATFEIT YEAFEPHSSYSEPLV KKRAVLLTP LNPDYTFEN 106 

H.pyl. NKVG-MHLAHSVDVR lEVAPKIQINAQSNI NYKAIKTS VKDSYTFEN 118 

:P=.inar. FWGPNSRMAHAAAM AVAESPGREFNPLFI CGGVGLGKTHLMQAI GHYRLEIDPGAKVSY VSTETFTNDLIL — A IRQDRMQAFRDRYR- 217 

i^lri.sp. FWGPTNRMAHAASL AVAESPGREFNPLFL CGGVGLGKTHLMQAI AHYRLEMYPNAKVYY VSTERFTNDLIT— A IRQDNMEDFRSYYR- 202 

Bf.SUt. FVIGSOJRFAHAASL AVAEAPAKAYNPLFI YGGVGLGKTHIMHAI GHYVIDHNPSAKWY LSSEKFTNEFIN~S IRDNKAVDFRNRYR- 206 

^wltub. FVIGASNRFAHAAAL AIAEAPARAYNPLFI WGESGLGKTHLLHAA GNYAQRLFPGMRVKY VSTEEFTNDFIN— S LRDDRKVAFKRSYR- 263 

:^lth. SWWGPTTPWPHGGAV AVAESPGRAYNPLFI YGGRGLGKTYLMHAV GPLRAKRFPHMRLEY VSTETFTNELINRPS AR-DROTEFRERYR- 196 

l^iCOli FVEGKSNQLARAAAR QVADNPGGAYNPLFL YGGTGLGKTHLLHAV GNGIMARKPNAKWY MHSERFVQDMVK--A LQNNAIEEFKRYYR- 227 

Lrf'-^mar. FWGPO^SFAYHAAL EVAKHPGR-YNPLFI YGGVGIiGKTHLLQSI OJYWQNEPDLRVMY ITSEKFLNDLVD--S MKEGKLNEFREKYRK 193 

WIpyl. FWGSCNNTVYEIAK KVAQSDTPPYNPVLF YGGTGLGKTHILNAI GNHALEK— HKKWL VTSEDFLTDFLK— H LDNKTMDSFKAKYR- 203 

P'.mar. AADLILVDDIQFIEG KEYTQEEFFHTFNAL HDAGSQIVLASDRPP SQIPRLQERLMSRFS ^fGLIADVQAPDLETR MAILQKKAEHERVGL 307 

'iSyn.sp. SADFLLIDDIQFIKG KEYTQEEFFHTFNSL HEAGKQVWASDRAP QRIPGLQDRLISRFS MGLIADIQVPDLETR MAILQKKAEYDRIRL 292 

'Sfsut. NVDVLLIDDIQFLAG KEC3TQEEFFHTFOTL HEESKQIVISSDRPP KEIPTLEDRLRSRFE WGLITDITPPDLETR lAILRKKAKABGLDI 296 

iitiltub. DVDVLLVDDIQFIEG KEGIQEEFFHTFNTL HNANKQIVISSDRPP KQLATLEDRLRTRFE WGLITDVQPPELETR lAILRKKAQMERLAV 353 

i^Uth. SVDLLLVDDVQFIAG KERTQEEFFHTFNAL YEAHKQIILSSDRPP KDILTLEARLRSRFE WGLITDNPAPDLETR lAILKM^-SGPED 285 

jE-^coli SVDALLIDDIQFFAN KERSQEEFFHTFNAL LEGNQQIILTSDRYP KEINGVEDRLKSRFG WGLTVAIEPPELETR VAILMKKADENDIRL 317 

Xjmar. KVDILLIDDVQFLIG KTGVQTELFHTFNEL HDSGKQIVICSDREP QKLSEFQDRL.VSRFQ MGLVAKLEPPDEETR KSIARKMLEIEHGEL 283 

^Hfipyl. HCDFFLLDDAQFLQG KPKLEEEFFHTFNEL HANSKQIVLISDRSP KNIAGLEDRLKSRFE WGITAKVMPPDLETK LSIVKQKCQLNQITL 293 

P.rnar. PRDLIQFIAGRFTSN IRELEGALTRAIAFA SITGLPinVDSIAPM LD PNGQGVEVT PKQVLDKVAEVFKVT PDEMRSASRRR-PVS 392 

Syn.sp. PKEVIEYIASHYTSN IRELEGALIRAIAYT SLSNVAMTVENIAPV LN PPVEKVAAA PETIITIVAQHYQLK VEELLSNSRRR-EVS 377 

B.SUt. PNEVMLYIANQIDSN IRELEGALIRWAYS SLINKDINADIAAEA LKDII-PSSKPKVIT IKEIQRWGQQFNIK LEDFKAKKRTK-SVA 384 

M.tub. PDDVLELIASSIERN IRELEGALIRVTAFA SLNKTPIDKALAEIV LRDLI-ADANTMQIS AATIMAATAEYFDTT VEELRGPGKTR-ALA 441 

T.th. PEDALEYIARQVTSN IREWEGALMRASPFA SLM3VELTRAVAAKA liRHLR-P — HELEAD PUEIIRKAAGPVRPE TPGGAHGEKRKKEW 372 

E.coli PGEVAFFIAKRLRSN VRELEGALNRVIANA NFTGRAITIDFVREA LRDLL-A-LQEKLVT IDNIQKTVAEYYKIK VADLLSKRRSR-SVA 404 

T.mar. PEEVLNFVAENVDDN LRRIiRGAI IKLLVYK ETTGKEVDLKEAILL LKDFIKPNRVKAMDP IDELIEIVAKVTGVP REEILSNSRNV-KAL 372 

H.pyl. PEEVMEYIAQHISDN IRQMEGAIIKISVNA NLMNASIDLNLAKTV LEDL — QKDHAEGSS LENILLAVAQSLNLK SSEIKVSSRQK-NVA 380 



P.rnar, QARQVGKYLMRQGTN LSLPRIGDTFGGKDH TTVMYAIEQVEKKLS S DPQIA SQVQKIRDLLQIDSR RKR 461 

Syn.sp. LARQVG^f^LMRQHTD LSLPRIGEAFGGKDH TTVMYSCDKITQLQQ K DWETS QTLTSLSHRINIAGQ APES 447 

B.SUt. FPRQIAMYLSREMTD SSLPKIGEEFGGRDH TTVIHAHEKISKLLA D DEQLQ QHVKEIKEQLK 446 

M.tub. QSRQIAMYLCRELTD LSLPKIGQAFG-RDH TTVMYAQRKILSEMA E RREVF DHVKELTTRIRQRSK R 507 

T.th. LPRQLAMYLVRELTP ASLPEIGQLFGGRDH TTVRYAIQKVQELAG KP DREVQ GLLRTLREACTDPVD NLWITCG 446 

E.coli RPRQMAMALAKELTN HSLPEIGDAFGGRDH TTVLHACRKIEQLRE E SHDIK EDFSNLIRTLSS 467 

T.mar. TARRIGMYVAKNYLK SSLRTIAEKFN-RSH PWVDSVKKVKDSLL KG NKQLK ALIDEVIGEISRRAL SG 440 

H.pyl. LARKLWYFARLYTP NPTLSLAQFLDLKDH SS I SKMYSGVKKMLE EEKSPFVLSLREEIK NRLNELNDPOCTAFMS SE 457 
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GTGTCGCACGAGGCCGTCTGGCAACACGTTCTGGAGCACA 

TCCGCCGCAGCATCACCGAGGTGGAGTTCCACACCTGGTT 

TGAAAGGATCCGCCCCTTGGGGATCCGGGACGGGGTGCTG 120 

GAGCTCGCCGTGCCCACCTCCTTTGCCCTGGACTGGATCC 

GGCGCCACTACGCCGGCCTCATCCAGGAGGGCCCTCGGCT 

CCTCGGGGCCCAGGCGCCCCGGTTTGAGCTCCGGGTGGTG 240 

CCCGGGGTCGTAGTCCAGGAGGACATCTTCCAGCCCCCGC 

CGAGCCCCCCGGCCCAAGCTCAACCCGAAGATACCTTTAA 

AACTTCGTGGTGGGGCCC AACAACTCCATGGCCCCACGGC 360 

GGCGCCGTGGCCGTGGCCGAGTCCCCCGGCCGGGCCTACA 

ACCCCCTCTTCATCTACGGGGGCCGTGGCCTGGGAAAGAC 

CTACCTGATGC ACGCCGTGGGCCC ACTCCGTGCGAAGCGC 480 

TTCCCCCACATGAGATTAGAGTACGTTTCCACGGAAACTT 

TCACCAACGAGCTCATCAACCGGCCATCCGCGAGGGACCG 

GATGACGGAGTTCCGGGAGCGGTACCGCTCCGTGGACCTC 600 

CTGCTGGTGGACGACGTCCAGTTCATCGCCGGAAAGGAGC 

GCACCCAGGAGGAGTTTTTCCACACCTTCAACGCCCTTTA 

CGAGGCCC AC AAGC AGATC ATCCTCTCCTCCG ACCGGCCG 720 

CCCAAGGACATCCTCACCCTGGAGGCGCGCCTGCGGAGCC 

GCTTTGAGTGGGGCCTGATCACCGACAATCCAGCCCCCGA 

CCTGGAAACCCGGATCGCC ATCCTGAAGATGAACGCCAGC 840 

AGCGGGCCTGAGGATCCCGAGGACGCCCTGGAGTACATCG 

CCCGGCAGGTCACCTCCAACATCCGGGAGTGGGAAGGGGC 

CCTCATGCGGGCATCGCCTTTCGCCTCCCTCAACGGCGTT 960 

GAGCTGACCCGCGCCGTGGCGGCCAAGGCTCTCCGACATC 

TTCGCCCCAGGGAGCTGGAGGCGGACCCCTTGGAGATCAT 

CCGCAAAGCGGCGGGACCAGTTCGGCCTGAAACCCCGGGA 1080 

GGAGCTCACGGGGAGCGCCGCAAGAAGGAGGTGGTCCTCC 

CCCGGCAGCTCGCCATGTACCTGGTGCGGGAGCTCACCCC 

GGCCTCCCTGCCCGAGATCGACCAGCTCAACGACGACCGG 1200 

GACCACACCACGGTCCTCTACGCCATCCAGAAGGTCCAGG 

AGCTCGCGGAAAGCGACCGGGAGGTGCAGGGCCTCCTCCG 

CACCCTCCGGGAGGCGTGCACATGA 



B) 

VSHEAVWQHVLEHIRRSITEVEFHTWFERIRPLGIRDGVL 

ELAVPTSFALDWIRRHYAGLIQEGPRLLGAQAPRFELRW 

PGVWQEDIFQPPPSPPAQAQPEDTFKTSWWGPTTPWPHG 120 

GAVAVAESPGRAYNPLFIYGGRGLGKTYLMHAVGPLRAKR 

FPHMRLEYVSTETFTNELINRPSARDRMTEFRERYRSVDL 

LLVDDVQFIAGKERTQEEFFHTFNALYEAHKQIILSSDRP 240 

PKDILTLEARLRSRFEWGLITDNPAPDLETRIAILKMNAS 

SGPEDPEDALEYIARQVTSNIREWEGALMRASPFASLNGV 

ELTRAVAAKALRHLRPRELEADPLEIIRKAAGPVRPETPG 360 

GAHGERRKKEWLPRQLAMYLVRELTPASLPEIDQLNDDR 

DHTTVLYAIQKVQELAESDREVQGLLRTLREACT 
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ATG AACATAACGGTTCCC AAAAAACTCCTCTCGGACCAGC 4 0 
TTTCCCTCCTGGAGCGCATCGTCCCCTCTAGAAGCGCCAA 
CCCCCTCTACACCTACCTGGGGCTTTACGCCGAGGAAGGG 120 
GCCTTGATCCTCTTCGGGACCAACGGGGAGGTGGACCTCG 
AGGTCCGCCTCCCCGCCGAGGCCCAAAGCCTTCCCCGGGT 200 
GCTCGTCCCCGCCCAGCCCTTCTTCCAGCTGGTGCGGAGC 
CTTCCTGGGGACCTCGTGGCCCTCGGCCTCGCCTCGGAGC 280 
CGGGCCAGGGGGGGCAGCTGGAGCTCTCCTCCGGGCGTTT 
CCGC ACCCGGCTC AGCCTGGCCCCTGCCGAGGGCTACCCC 360 
GAGCTTCTGGTGCCCGAGGGGGAGGACAAGGGGGCCTTCC 
CCCTCCGGACGCGGATGCCCTCCGGGGAGCTCGTCAAGGC 440 
CTTGACCCACGTGCGCTACGCCGCGAGCAACGAGGAGTAC 
CGGGCCATCTTCCGCGGGGTGCAGCTGGAGTTCTCCCCCC 520 
AGGGCTTCCGGGCGGTGGCCTCCGACGGGTACCGCCTCGC 
CCTCTACGACCTGCCCCTGCCCC AAGGGTTCC AGGCC AAG 600 
GCCGTGGTCCCCGCCCGGAGCGTGGACGAGATGGTGCGGG 
TCCTGAAGGGGGCGGACGGGGCCGAGGCCGTCCTCGCCCT 680 
GGGCGAGGGGGTGTTGGCCCTGGCCCTCGAGGGCGGAAGC 
GGGGTCCGGATGGCCCTCCGCCTCATGGAAGGGGAGTTCC 760 
CCGACTACCAGAGGGTCATCCCCCAGGAGTTCGCCCTCAA 
GGTCCAGGTGGAGGGGGAGGCCCTCAGGGAGGCGGTGCGC 840 
CGGGTGAGCGTCCTCTCCGACCGGCAGAACCACCGGGTGG 
ACCTCCTTTTGGAGGAAGGCCGGATCCTCCTCTCCGCCGA 920 
GGGGGACTACGGCAAGGGGCAGGAGGAGGTGCCCGCCCAG 
GTGGAGGGGCCGGACATGGCCGTGGCCTACAACGCCCGCT 1000 
ACCTCCTCGAGGCCCTCGCCCCCGTGGGGGACCGGGCCCA 
CCTGGGCATCTCCGGGCCCACGAGCCCGAGCCTCATCTGG 1080 
GGGGACGGGGAGGGGTACCGGGCGGTGGTGGTGCCCCTCA 
GGGTCTAG 1128 



B) 

MNITVPKKLLSDQLSLLERIVPSRSANPLYTYLGLYAEEG 40 
ALILFGTNGEVDLEVRLPAEAQSLPRVLVPAQPFFQLVRS 
LPGDLVALGLASEPGQGGQLELSSGRFRTRLSLAPAEGYP 120 
ELLVPEGEDKGAFPLRTRMPSGELVKALTHVRYAASNEEY 
RAIFRGVQLEFSPQGFRAVASDGYRLALYDLPLPQGFQAK 200 
AWPARSVDEMVRVLKGADGAEAVLALGEGVLALALEGGS 
GVRMALRLMEGEFPDYQRVI PQEFALKVQVEGEALREAVR 280 
RVSVLSDRQNHRVDLLLEEGRILLSAEGDYGKGQEEVPAQ 
VEGPDMAVAYNARYLLEALAPVGDRAHLGISGPTSPSLIW 3 60 
GDGEGYRAVWPLRVZ 
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T . th .beta MNITVPKKIXSIX3LSLI£RIVPSRSANPLYrYlJGLYAEEG^ 

E . col i . be t MKPTVEREHLIJCPUJQVSGPI^RPTLPILGWI^ 

P . mirab . be MKFI IEREQIJLJCPI^VSGPIX3GRPTLPII/3NIXUCVTE3^ 

H . inf 1 , bet MQFSISRENI^PLQQWGVLSmPNIPVIMWIJ^IEDY^ 

P . put . beta MHFTIQREAIXKPLQLVAGWVERRQTLPVLS^A/IZ.WQGQ^ 

B . cap . beta MKFTIQNDILTKmjaCITRVLVKNISFPII£NILIQVEIX5TLSLT^^ 

* * * * * * * 

T . th . beta AQSLP-RVLVPAQPFFQLVRSLPGDLVALGIJ^PGQGGQI^LSSGRFRTRI^IiAPAEOT 

E . CO 1 i . be t QPHEPGATTVPARKFFDICRGLP-EGAEIAVQLE GERMLVRSGRSRFSLSTLPAADF 

P . mirab . be QSHEIGATTVPARKFFDIWRGLP-EGAEISVELD GDRLLVRSGRSRFSLSTLPASDF 

H . inf 1 . bet SSSENGTFTIPAKKFLDICRTLS-DDSEITVTFE QDRALVQSGRSRFTLATQPAEEY 

P . put . beta EPAEPGEITVPARKLMDICKSLP-NDALIDIKVD EQKLLVKAGRSRFTLSTLPANDF 

B . cap . beta TKYIPGKTTISGRKILNICRTLS-EKSKIKMQLK NKKMYISSENSNYILSTLSADTF 

* * * 

T . th . beta PELLVPEGEDKGAFPIJITRMPSGELVKALTHVRYAASNEEYRAIFRGVQLEFSPQGFRAV 

E . COli .bet PNLDD — WQSEVEFTLPQAT MKRLIEATQFSMAHQDWYYI2«]MLFETEGEEIAW 

P . mirab . be PNLDD — WQSEVEFTLPQAT LKRLIESTQFSMAHQDTOYYLNGMLFETENTELRTV 

H . inf 1 . bet PNLTD — WQSEVDFELPQNT LRRLIEATQFSMANQDARYFLNGMKFETEGNLLRTV 

P . put . beta PTVEE — GPGSLTCNLEQSK LRRLIERTSFAMAQQDVRYYLNGMLLEVSROTLRAV 

B. cap. beta PNHQN--FDYISKFDISSNI LKEMIEKTEFS^GKQDVRYYL^^GMLLEKKDKFLRSV 

# ***** 

T . th . beta ASDGYRLALYDLPLPQGFQA — KAVVPARSVDEMVKVLKGADGAEAVLALGEGVLALALE 

E . coli . bet ATDGHRLAVCSMPIGQSLPS-HSVWPRKGVIELMRMLDG-GDNPLRVQIGSNNIRAHVG 

P . mirab . be ATDGHRLAVCAMDIGQSLPG-HSV WPRKGVIELMRLLDGSGESLLQLQIGSNNLRAHVG 

H . inf 1 . bet ATDGHRLAVCTISLEQELQN-HSVILPRKGVLELVRLLET-NDEPARLQIGT^mJW^^ 

P . put . beta STDGHRLALCSMSAPIEQEDRHQVIVPRKGILELARLLTD-PEGMVSIVLGQHHIRATTG 

B . cap . beta ATDGYRLAISYTQLKKDI^IF-FSIIIPNKAVMELLKLL^P^-QPQLLNILIGSN^ 
***** *** * 

T . th . be ta GGSGVRMALRLMEGEFPDYQRVI PQEFALKVQVEGEALREAVRRVSVLSDRQNHRVDLLL 

E . coli . bet DFIFTSKLVDGRFPDYRRVLPKNPDKHLEAGCDLLKQAFARAAILSNEKFRGVRLYV 

P . mirab . be DFIFTSKLVDGRFPDYRRVLPKNPTKTVIAGCDILKQAFSRAAILSNEKFRGVRINL 

H . inf 1 . bet ^m/FTSKLIDGRFPDYRRVLPRmTKIVEGNWEMLKQAFARASILSNERARSV^ 

P . put . beta EFTFTSKLVDGKFPDYERVLPKGGDKLWGDRQALREAFSRTAILSNEKYRGIRLQL 

B , cap . beta ^^^IFTTQLIEGEYPDYKSVLFKEKKNPIITNSILLKKSLLRVAILAHEKFCGIEIKI 

^ * * *** * * * * 

T . th .beta EEGRILLSAEGDYGK-GQEEVPAQVEGPDMAVAYNARYLLEALAPVG-DRAHLGISGPTS 

E . CO 1 i . bet SENQLKITANNPEQEEAEEIIJ3VTYSGAEMEIGFNVSYVLDVLt^^ 

P . mirab . be TNGQLKITAl^PEQEEAEEIVDVQYQGEEMEIGFNVSYLLD\n^mTJCCEEV^ 

H . inf 1 .bet KENQLKITASmCTEEAEEIVDVirasrGEELEVGFNVTYILDV^^ 

P . put . beta AAGQLKIQANNPEQEEAEEEISVDYEGSSLEIGFWSYLLDVLGVMITE^ 

B , cap . beta ENGKFKVLSDNQEEETAEDLFEIDYFGEKIEISI^A/YYLLDVINNIKSENIALFLNKSKS 



T. th.beta PSLIWGDG-EGYRAWVPLRVZ {ID#108) 

E. coli. bet SVQIEDAASQSAAYWMPMRL2 (ID#109) 

P . mirab . be SVQVENVASAAAAYWMPMRL- { ID# 11 0 ) 

H. inf 1. bet SCLIENCEDSSCEYVIMPMRL- (ID#111) 

P. put .beta SALLQEAGNDDSSYWMPMRL- (ID#112 ) 

B . cap . beta SIQIEAENNSSNAYWMLLKR- ( ID#113 ) 
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